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Here, we present the draft genome of Enterococcus hirae INF El, found as a contaminant in cultured milk and studied for its 
ability to metabolize milk fat globule membrane glycoconjugates. 



Received 5 May 2014 Accepted 26 June 2014 Published 17 July 2014 

Citation Porcellato D, 0stlie HM, Skeie SB. 2014. Draft genome sequence of Enterococcus hirae strain INF El isolated from cultured milk. Genome Announc. 2(4):e00498-14. 
doi:l 0.1 1 28/genomeA.00498-l 4. 

Copyright © 2014 Porcellato et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. 
Address correspondence to Davide Porcellato, davide.porcellato@nmbu.no. 



The finding of the genus Enterococcus in the cheese environment 
is controversial. Its presence is considered essential for flavor 
development in some Mediterranean cheese varieties, while in 
most other cheeses it is considered negative, as some strains of this 
genus are opportunistic pathogens (1,2). The main source of En- 
terococcus spp. in dairy products is the environmental contamina- 
tion of milk, and in case of traditional fermented cheese made 
from raw milk the presence of Enterococcus spp. has been well 
described (3, 4). Some Enterococcus strains isolated from the dairy 
environment have previously shown good adaptability to grow in 
cheese environments as nonstarter lactic acid bacteria (5). Meta- 
bolic studies have shown interesting features of a cultured milk 
isolate of Enterococcus hirae (INF El) regarding its ability to utilize 
most milk fat globule membrane monosaccharides (6) and to sur- 
vive the harsh conditions in the gastrointestinal tract (7). This 
suggests that E. hirae INF El survives in high numbers during 
cheese ripening and enters the intestine in good shape. 

Here, we present the draft genome sequence of the cultured 
milk isolate E. hirae INF El. Good-quality DNA was submitted for 
sequencing on an Illumina MiSeq platform at the Norwegian Se- 
quencing Center (University of Oslo, Oslo, Norway) with 150- 
fold coverage of the genome. High-quality reads were de novo 
assembled using CLC Genomics Workbench 5.5 (CLC bio). Con- 
tigs obtained from the assembler and with lengths over 1,000 bp 
and coverages over 20 X were oriented and ordered against the 
complete genome sequence of E. hirae ATCC 9790 using progres- 
siveMauve (8). Coding DNA sequences (CDS) were predicted and 
annotated using the NCBI Prokaryotic Genome Annotation Pipe- 
line (9). 

The draft genome of E. hirae INF El consists of 22 contigs, for 
a total of 2,807,725 bp with a GC content of 36.8%. The largest 
contig is 560,880 bp. The total number of coding sequences (CDS) 
was 2,404 and the number of RNAs was 53. A plasmid of length 
5,376 bp was found and encoded 7 hypothetical proteins. Two 
confirmed clustered regularly interspaced short palindromic re- 
peat (CRISPR) arrays were identified using CRISPRfinder (10). 

Interestingly, three gene clusters for the production of second- 
ary metabolites were identified using the Antibiotics and Second- 
ary Metabolite Analysis Shell (11). The three gene clusters encode 



proteins for the production of a putative class II lantipeptide, a 
terpene, and a bacteriocin. The construction of the metabolic 
model for E. hirae INF El will help to elucidate the possible energy 
sources and metabolic pathways present in this isolate and the 
biological importance and adaptation of this species in the dairy 
environment. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBI/EMBL/GenBank un- 
der the accession number JMIG00000000. The version described 
in this paper is version JMIGO 1000000. 
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